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(54) Provision of continuous database service and scalable query performance using active 
redundant copies 



(57) A method and apparatus tor distributing com- 
puter resources in a network environment. A network ot 
computer systems is partitioned into at least one com- 
puting system partition., and is configured into at least 
one redundancy group. The computing system parti- 
tions include applications, computing system nodes, 
and copies ot a database schema. The copies of the 
database schema are replicated at each computing sys- 



tem partition within a network. The computing system 
partition manages interactions between the instances, 
the computing system nodes, and the copy of the data- 
base schema within the respective computing system 
partition. The redundancy group comprises at least one 
computing system and at a plurality of computing sys- 
tem partitions, and manages the replication of the data- 
base schema within the computing system and comput- 
ing system partitions. 
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Description 

[0001] The invention relates in general to computer 
systems, and more particularly, to a provision for con- 
tinuous database service using active redundant copies 
of the database. 

[0002] Many modern computer systems are in nearly 
continuous use, and have very little time to be taken 
"down" or "offline - for database updates or preventative 
maintenance. Further, computer systems increasingly 
require systems that virtually never fail and have little or 
no scheduled downtime. As a concurrent requirement, 
these same systems demand cost-effective computing 
solutions, open systems to avoid or reduce specific sup- 
plier dependencies, and the ability to leverage the latest 
hardware and software technologies as they become 
available. 

[0003] Modern computer systems also have transi- 
tioned from a static installation to a dynamic system that 
regularly changes. The system continually contains new 
collections of products that are processing requests 
from a constantly changing user base. The ability of 
computing solutions to provide service availability in a 
dynamic environment is becoming increasingly impor- 
tant, because the pace of change in products and cus- 
tomers' environments is expected to increase. The term 
"change tolerance" has been used to describe the ability 
of a computer system to adapt to the dynamic environ- 
ment required 

[0004] It can be seen, then, that there is a need in the 
art for a system that provides a high confidence level for 
continuous processing. It can also be seen, then, that 
there is a need in the art for a system with a high change 
tolerance. It can also be seen, then, that there is a need 
in the art for a system with reasonable development 
costs and implementation schedules that does not sac- 
rifice the benefits of open systems. 
[0005] It is the object of the present invention to pro- 
vide a system which overcomes the limitations in the pri- 
or art as described above; 

[0006] According to a first aspect, the present inven- 
tion discloses a system for providing continuous data- 
base access, comprising: 

at least one computing system partition, including 
at least one instance of an application, at least one 
computing system node, and at least one copy of a 
database schema, the copies of the database sche- 
ma being replicated at each computing system par- 
tition within a network, and wherein each computing 
system partition manages interactions between the 
instances, the computing system nodes, and the 
copy of the database schema within the respective 
computing system partition; 

a plurality of computing systems connected togeth- 
er via the network, wherein each computing system 
comprises one or more computing system parti- 
tions; 



at least one redundancy group, comprising at least 
one computing system and a plurality of computing 
system partitions, wherein each redundancy group 
manages the replication of the database schema 
s within the computing system and computing system 
partitions within the redundancy group. 

[0007] The computing systems are connected togeth- 
er via the network, and comprise one or more computing 

io system partitions. The redundancy group comprises at 
least one computing system and at a plurality of com- 
puting system partitions, and manages the replication 
of the database schema within the computing system 
and computing system partitions. 

is [0008] According to a second aspect, this present in- 
vention resides in a method for distributing computer re- 
sources in a network environment, comprising the steps 
of: 

20 assembling, as part of a computer network, at least 
one computing system partition, including at least 
one instance of an application, at least one comput- 
ing system node, and at least one copy of a data- 
base schema, the copies of the database schema 

2S being replicated at each computing system partition 
within the computer network; 

configuring, within the computer network, a plurality 
of computing systems connected together via the 
computer network, wherein each computing system 
30 comprises one or more computing system parti- 
tions; 

configuring, within the computer network, at least 
one redundancy group, comprising at least one 
computing system and at a plurality of computing 
35 system partitions; and 

managing at least one task using the database 
schema within the computing system and comput- 
ing system partitions with the redundancy group. 

40 [0009] According to a third aspect, the present inven- 
tion resides in a method for providing database access, 
comprising the steps of: 

operating at least one computing system within a 
45 network, the computing system containing at least 
one computing system partition and the computing 
system being a member of a redundancy group, 
wherein the computing system partition includes at 
least one instance of an application, at least one 
50 computing system node, and at least one copy of a 
database schema, the copies of the database sche- 
ma being replicated at each computing system par- 
tition within a network; and 

55 managing the replication of the database schema 

within the computing system and computing system 
partitions within the redundancy group. 



2 



BNSDOCID: <EP 0992909A2J_> 



3 



EP 0 992 909 A2 



4 



[0010] An embodiment of the present invention will 
now be described by way of reference to the accompa- 
nying drawings in which: 

[001 1] Referring now to the drawings in which like ref- 
erence numbers represent corresponding parts 
throughout: 

FIG. 1 is a block diagram that illustrates an exem- 
plary hardware environment that could be used with 
the present invention; 

FIG. 2 illuWates'the computing system partition ^o 
the present invention; 

FIG. 3 illustrates the redundancy strategy of the 
present invention; 

FIG. 4 illustrates a model of the computer architec- 
ture of the present invention; 
FIG. 5 illustrates replication of the database using 
the present invention; 

FIG. 6 illustrates temporal consistency of the data- 
base that is propagated by the present invention; 
and 

FIG. 7 is a flowchart that illustrates exemplary logic 
performed by the controller according to the present 
invention. 

Overview 

[0012] The present invention discloses a method, ap- 
paratus, and article of manufacture for distributing com- 
puter resources in a network environment. 
[001 3] At least one redundancy group and at least one 
computing system partition within the redundancy group 
are defined. The redundancy group comprises selected 
computer resources within the network environment. 
The selected computer resources include at least one 
application: at least one computer node, and at least one 
copy of a database. The computing system partitions 
comprise a subset of the selected computer resources 
within the network environment. The subset of the se- 
lected computer resources including at least one appli- 
cation, at least one computing node, and using only one 
copy of the database within the network environment. 
Tasks are performed within the network using the first 
computing system partition to perform the task. 
[001 4] Control for the detection of reassigning the re- 
cipient of the updates can occur upon hardware or soft- 
ware problems with the first assignee, or to allow the 
first assignee to be taken out of service for maintenance 
purposes. This control is provided by a combination of 
software systems operating on each of the networked 
computer systems, and can also be provided on exter- 
nal computers called Control Computers. The software 
on the networked computer and control computer to- 
gether determine the status of each of the networked 
computers to determine when to reassign the recipient 
computer, and rf so, which of the networked computers 
should receive the database updates. The determina- 
tion is achieved by using periodic messages, time-out 



values, and retry counts between the software on the 
networked computers and the control computers. 

Hardware Environment 

5 

[0015] FIG. 1 is an exemplary hardware environment 
used to implement the preferred embodiment of the in- 
vention. The present invention is typically implemented 
using a plurality of computing systems 100A-100D, 

10 each of which generally includes, inter alia, a processor, 
random access memory (RAM); data storage devices 
(e.g., hard, floppy, and/or CD-ROM disk drives, etc.), da- 
ta communications devices (e.g., modems, network in- 
terfaces, etc.), monitor (e.g., CRT, LCD display, holo- 

is graphic display device, etc.), etc. 

[0016] The computing systems 100A-100D are cou- 
pled together via network 102 and comprise a redun- 
dancy group 104. Each computing system 100A-D fur- 
ther comprises one or more computing system parti- 

20 tions, which are described in further detail in FIGS. 24. 
In addition, management centers 106 A and 106B can 
be coupled to network 102. Management centers 106 A 
and 1 06B are representative only; there can be a greater 
or lesser number of management centers 1 06 in the net- 

2S work 102. Further, there can be a greater or lesser 
number of computing systems 1 0OA-1 00D connected to 
the network 102 : as well as a greater or lesser number 
of computing systems 100A-D within the redundancy 
group 104. 

30 [0017] The present invention also teaches that any 
combination of the above components, or any number 
of different components, including computer programs, 
peripherals, and other devices, may be used to imple- 
ment the present invention, so long as similar functions 

3$ are performed thereby. The presentation of the compu- 
ter system as described in FIG. 1 is not meant to limit 
the scope of the present invention, but to illustrate one 
possible embodiment of the present invention. 

40 Relationships and Operation 

[0018] FIG. 2 further illustrates the components within 
a computing system of the present invention. Within the 
computing systems 1 00A-D are one or more computing 

45 system partitions (CSPs) 202. Each CSP 202 is coupled 
to only one copy of a database 204. The computing sys- 
tems 100A-D are coupled together via network 102. 
[0019] Management center computer 1 06A (or, alter- 
natively, 1 06B) can be used to control the flow of data 

so from the database copies 204 and updates to the com- 
puter systems 100A-100D. The database 204 can also 
be controlled directly from computer systems 1 00A-D if 
desired. 

[0020] Each copy of the database 204 is associated 
55 with a computing system partition (CSP) 202. As shown 
in FIG. 2, each computer system 100A-D can have one 
or more CSP's 202 resident within a computer system, 
as illustrated in computer system 10OA. 
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[0021] A redundancy, group 104 is a collection of 
Computing System Partitions 202 (CSPs) collaborating 
in an actively redundant fashion on a specific workload 
using a single replicated database 204 schema. The 
CSPs 202 may be resident on a single node computer 
system 100, a multi-node computer system 100, or on 
selected subsets of computing nodes from one or more 
multi-node computer systems 100. Each CSP 202 has 
an independent database copy of the database 204 for 
the redundancy group 104. The definition for a CSP 202 
is that- set of -computing resources-using a single copy 
of the replicated database 204. 

[0022] The fundamental component of a CSP 202 is 
a single computing node executing an independent 
copy of an operating system. However, CSP 202 may 
consist of multiple nodes and, therefore, multiple oper- 
ating system instances. The operating system operating 
on each CSP 202 can be different, e.g., one CSP 202 
may be using Windows, while another CSP 202 uses 
Unix, etc. An operating system instance may be a par- 
ticipant in one and only one redundancy group 104, 
meaning that the computing nodes comprising a CSP 
202 are "owned" by that redundancy group 1 04. A multi- 
node system can have different nodes participating in 
different redundancy groups 104, but there must be no 
'overlap between redundancy groups 104. 
[0023] To synchronize and replicate the database 204 
between the computer systems 100A-100D, one ofthe 
computer systems 100A-D is responsible for receiving 
direct updates of the database 204 via network 1 02 and 
disseminating or replicating those updates of database 
204 to the remaining computing systems 100. 
[0024] As an example, computer system 1 00B can be 
designated as the recipient ofthe direct updates to da- 
tabase 204. Once the updates are received by computer 
system 1 00B, computer system 1 00B then sends a copy 
ofthe database 204 with updates to computer systems 
100A, 100C, and 100D via network 102. This process 
continues until computer system 100B has sent a copy 
of database with updates to all computer systems within 
the network 102. 

[0025] If computer system 1 00B is unavailable, the re- 
sponsibility of replicating the database and updates 
shifts to another computer in the network 102. As an ex- 
ample, if computer system 100B is unavailable, the da- 
tabase replication responsibility shifts to computer sys- 
tem 100C, which then receives direct updates. Compu- 
ter system 1 00C then replicates the database and up- 
dates to computer systems 100A and 100D. Computer 
system 100C continues the replication until all comput- 
ers in the network 102 receive copies of the database 
and updates. 

Redundancy Strategy 

[0026] FIG. 3 illustrates the hierarchical redundancy 
strategy of the present invention. To effectively perform 
the replication of the database 204 and the updates as 



described in FIG. 2, the present invention partitions the 
network 102 into redundancy groups 104. Each redun- 
dancy group 104 is comprised of computing systems 
100, computing system partitions 202, application in- 
s stances 302, computing system nodes 304, and data- 
base copy 306. Database copy 306 is comprised of da- 
tabase 204 and updates. 

[0027] Typical networks 1 02 have multiple redundan- 
cy groups 104. The relationship between redundancy 

10 groups 104 is somewhat limited, but all redundancy 
groups 104 can participate in a global network 102, and 
a global administration view is typically used for such a 
network 102. In general, however, redundancy groups 
104 are envisioned to be mostly independent of each 

is other and constructed for the purposes of application- 
level independence, administrative flexibility, or the abil- 
ity to use computing systems 1 00 of modest capabilities. 
[0028] The redundancy group 1 04 is the fundamental 
factor of service availability and scalable query pertorm- 

20 ance. The present invention uses the redundancy group 
104 to reduce or eliminate a server service outage so 
long as at least one CSP 202 in the redundancy group 
104 is fully operational. The present invention also uses 
the redundancy group 104 to scale query performance 

25 beyond that attainable with just one computing system 
partition 104 and one copy of the database 306. Query 
performance and availability scale as CSP's 202 are 
added to a redundancy group. With standard comput- 
ers, as performance goes up, availability typically goes 

30 down. The present invention allows both availability and 
query performance for data warehousing systems to 
both go up simultaneously. 

[0029] Redundancy groups 104 of the present inven- 
tion accommodate the condition in which CSPs 202 ar- 

35 bitrariiy undergo exit and re introduction scenarios, but 
a sufficiently configured redundancy group 1 04 does not 
cease proper functionality. The limits of redundancy 
group functionality and database 204 access is limited 
by scenarios outside of the control of the computer sys- 

40 tern, e.g., unplanned hardware or software malfunc- 
tions, etc. 

Computer Architecture Model 

46 [0030] FIG. 4 illustrates a model of the computer ar- 
chitecture of a computing system partition of the present 
invention. The architecture model 400 has three signif- 
icant environments: the management environment 402, 
the run-time environment 404, and the hardware envi- 

50 ronment 406. The management environment 402 is il- 
lustrated as redundancy group management. The run- 
time environment 404 comprises the software compo- 
nents that provide application services directly or indi- 
rectly, which is the majority of the components in the 

55 model 400. The hardware environment 406 is depicted 
as the hardware platform, e.g., computer network 102, 
and peripherals. 

[0031] Redundancy group management 402 com- 
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prises of the tools, utilities and services necessary to 
administer, supervise and provide executive control 
over elements of a redundancy group 104. The compo- 
nents within the redundancy group management 402 
environment include redundancy group administration 
408, redundancy group supervision 410, redundancy 
group execution 412. 

[0032] The redundancy group administration 408 
component provides tools for definition, configuration, 
and operations of a redundancy group 104. These tools 
communicate with other tools" that provide administra- 
tive control of product specific components. Operations 
include facilities to startup, shutdown, install, and/or up- 
grade elements of redundancy groups 104. Included in 
the upgrade and install categories are special facilities 
necessary for verification. Included in the definition and 
configuration capabilities are defining policies and pro- 
cedures to be used by both humans and machines. Ad- 
ditionally, it is foreseen that advanced utilities to deter- 
mine the scope of failures and subsequently identify re- 
covery procedures would be in this component. 
The tools of this environment are expected to execute 
in a management center facility. 

[0033] The redundancy group supervision 410 com- 
ponent provides those services that monitor the health 
of a redundancy group 104. Included are the services 
for status request handling, heartbeat setup and moni- 
toring, and failure detection. 

[0034] The redundancy group execution 412 compo- 
nent provides those executive services that manage 
and control the work load of a redundancy group. Includ- 
ed are those services that provide transaction and re- 
quest-level load balancing and reconfiguration. This 
component manages and controls the workload of nor- 
mal transactions as well as recovery requests. 

Run-time Environment 

[0035] The run-time environment 404 comprises the 
services necessary to support application programs 
within redundancy groups 104. The components of the 
run-time environment 404 include application execution 
services 414, applications 416, communications re- 
source services 418, global transaction services 420, 
shared resource services 422, database replication 
services 424, file i/o426, remote storage services 428, 
and network services 430. These components fall into 
two categories, 1 ) those components typically utilized 
by applications directly, and 2) those components typi- 
cally utilized by applications indirectly. Services that fall 
into the second category are used by those services in 
the first category. 

[0036] Application execution services 414 provide 
pre- and post-processing on behalf of an application 
416. Such services include application instantiation, pa- 
rameter marshaling, and queue access services. Appli- 
cation execution services 414 also inform the applica- 
tion 416 of the status of a given transaction request and 



its disposition; for example, whether it is a normal trans- 
action request, a recovery request, or whether the re- 
quest is a request to startup or shutdown the application . 
Application execution services 414 also include servic- 
5 es necessary to communicate to redundancy group 
management 402 components. Additionally, application 
execution services 414 handle application 416 error sit- 
uations. 

[0037] Applications 416 are services to the consum- 
10 ers of a system (network 102), and are composed of 
software-components. Applications 416 are reduced in 
complexity by leveraging other services in a rich oper- 
ating environment, such as application 416 execution 
services 414 and shared resource services 422, since 
1$ these other services supply needed levels of transpar- 
ency. 

[0038] The communication resource services 418 
component comprises services that provide application 
416-to-application 416 communications within redun- 
20 dancy groups. 

[0039] The global transaction services 420 compo- 
nent provides services to maintain transaction context 
and to coordinate transaction integrity procedures and 
protocols. These services include facilities for an appli- 
es cation 416 to query the global transaction status, and 
commit or abort transactions. 

[0040] The shared resource services 422 component 
is a general container for services that provide access 
to shared resources. In a redundancy group 104 the 

30 shared resources of interest are replicated databases 
204, and, therefore, database 204 access services re- 
side in the shared resource services 422 component. 
Database 204 access services include services that 
provide the capability to create, read, write, rewrite, and 

35 delete data within a replicated database. 

[0041] Database replication services 424 fall into the 
indirect class of application 41 6 services. The database 
replication services 424 propagate database updates 
transparently to all copies of the database 204 in a re- 

40 dundancy group 1 04. There are primarily two database 
204 replication models, as described in the discussion 
relating to FIG. 5. 

[0042] File i/o services 426 are not utilized directly by 
customer applications 416, but are provided for use by 
45 system software components requiring non-transac- 
tional, persistent data storage and access services. File 
i/o is typically used for logging or journaling functions, 
event capture, software executables, and data inter- 
change files. 

so [0043] Remote storage services 428 allow a given file 
update request to be processed at locations remote 
from the location ofthefile i/o request, enabling file rep- 
lication. System components that take advantage of 
these services are those that require non-transactional 

55 access to queues, logs and system files that would be 
inappropriate for storage in an database. 
[0044] Network services 430 include those services 
that provide high performance, highly reliable transport 
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of messages. Of specific interest are those services that 
provide multi-casting of messages which results in an 
optimal and guaranteed delivery of messages to all des- 
tinations in a specified domain of receivers. This com- 
ponent also benefits applications indirectly, e.g., cus- 
tomer applications would not call the interface that initi- 
ates these services. Rather, these services would be 
provided to the application 41 6 through communications 
resource services 41 B. 

[0045] Network platform 406 is the computing hard- 
ware, e.g.rnetwork 102, that is used for executing the 
instructions associated with the application 416, etc. 

Database Replication Schemes 

[0046] FIG. 5 illustrates replication of the database 
using the present invention. Within network 424, repli- 
cation schemes 500 and 502 can be utilized to replicate 
database 204. Either replication scheme 500 or replica- 
tion scheme 502, or both, can be used within network 
424, depending on the architecture of the redundancy 
groups 1 04. 

[0047] Database 204 replication is the synchroniza- 
tion mechanism between the database 204 copies in a 
redundancy group 104. The present invention could al- 
so utilize transaction-level replication (reprocessing the 
entire application transaction on each participating sys- 
tem) instead of entire database 204 replication, but the 
discussion relating to database 204 replication applies 
equally well to transaction-level replication. References 
herein relatingto database 204 replication include trans- 
action-level replication. 

[0048] At least two distinct database 204 replication 
models are supported by the present invention, peer/ 
peer replication model 500 and primary/subscriber rep- 
lication model 502. Other database replication models 
are envisioned, but the discussion herein is limited to 
the two models 500 and 502. The peer/peer replication 
model 502 update transactions are processed on any 
logical system in a redundancy group 104. Inter-copy 
database 204 consistency and serializability are main- 
tained either through global network 102 concurrency 
controls 504, or through commit certifications that occur 
within the redundancy group 104. 
[0049] In the primary/subscriber replication model 
502, all update transactions are routed to a single logical 
system, e.g., computer system 100A-D, in the redun- 
dancy group 104, called the primary system, which 
propagates updates to the other logical systems, e.g., 
computer systems 100A-D, after the commitment of a 
transaction is complete. The update transaction routing 
is performed transparently and automatically When, the 
primary logical system, e.g., computer system 100A, ex- 
its the redundancy group 104 (for reasons of failure or 
scheduled downtime) a new primary system is selected. 
See the discussion relating to FIG. 2. 
[0050] FIG. 6 illustrates temporal consistency of the 
database that is propagated by the present invention. 



Within either replication model 500 or 502, the database 
204 will have temporal inconsistencies because time is 
required to update the database 204 on each of the net- 
work 102 computers within a redundancy group 104. 

s Update propagation in replicated database 204 
processing has a side effect in that a trade-off must be 
made between update efficiency and the temporal con- 
sistency of thedatabase 204 copies in the redundancy 
group 104. It is possible to synchronize the database 

10 204 copies by propagating updates before the comple- 
tion of an update transaction, e:g., before releasing da- 
tabase 204 locks and allowing commit processing to 
complete. However, absolute synchronization requires 
propagation protocols that are complex and expensive 

is from a computing perspective. 

[0051] The present invention allows the database 204 
copies to deviate from each other in a temporal sense, 
and restrict consistency constraints to serializability and 
transaction-level atomicity. The approach of the present 

20 invention prevents any copy of the database 204 from 
having "dirty data," "partial updates," or out-of-order up- 
dates, but the timing ofthe appearance of the updates 
from a given transaction in any particular database 204 
copy will be delayed to an unpredictable degree. The 

2& temporal deviation between the database 204 copies 
will be dependent on numerous factors including hard- 
ware utilization, instantaneous transaction mix, and net- 
work 1 02 latency 

The effects of inter-copy temporal inconsistency can be 
30 mitigated with numerous application processing tech- 
niques, including restriction of updates to selected time 
windows (during which queries may be restricted), clev- 
er partitioning of the query processing workload, and 
clever partitioning and/or clustering of user queries to 
35 specific database copies. 

[0052] For a single replicated database schema, 
shown in replication model 502, each actively redundant 
configuration will support only one replicated database 
schema because oftransaction-level consistency con- 
40 straints. 

Logic of the Database Replicator 

[0053] FIG. 7 is a flowchart that illustrates exemplary 
45 logic performed by the controller according to the 
present invention. 

[0054] Block 700 represents the computer 100 as- 
sembling, as part of a computer network, at leasl one 
computing system partition, including at least one in- 
so stance of an application, at least one computing system 
node, and at least one copy of a database schema, the 
copies of the database schema being replicated at each 
computing system partition within the computer net- 
work. 

55 [0055] Block 702 represents the computer* 100 con- 
figuring, within the computer network, a plurality of com- 
puting systems connected together via the computer 
network, wherein each computing system comprises 
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one or more computing system partitions. 
[0056] Block 704 represents the computer 100 con- 
figuring, within the computer network, at least one re- 
dundancy group, comprising at least one computing 
system and at a plurality of computing system partitions. 
[0057] Block 706 represents the computer 100 man- 
aging at least one task using the database schema with- 
in the computing system and computing system parti- 
tions with the redundancy group. 

Conclusion 

[0058] This concludes the description of the preferred 
embodiment of the invention The following describes 
some alternative embodiments for accomplishing the 
present invention. For example, any type of computer, 
such as a mainframe, minicomputer, or personal com- 
puter, could be used with the present invention. In addi- 
tion, any software program utilizing (either partially or 
entirely) a database could benefit from the present in- 
vention. 

[0059] The foregoing description of the preferred em- 
bodiment of the invention has been presented for the 
purposes of illustration and description. It is not intended 
to be exhaustive or to limit the invention to the precise 
form disclosed. Many modifications and variations are 
possible in light of the above teaching. It is intended that 
the scope of the invention be limited not by this detailed 
description, but rather by the claims appended hereto 



Claims 



2. The system of claim 1. wherein the redundancy 
group defines a first computing system as the com- 
puting system that replicates the copy of the data- 
base to all other computing systems within the re- 

5 dundancy group. 

3. The system of claim 2, wherein the redundancy 
group defines a second computing system within 
the redundancy group for replicating the copy of the 

10 database to all other computing systems within the 
-redundancy group when the first computing system 
is unavailable. 



A system for providing continuous database ac- 
cess, comprising: 

at least one computing system partition, includ- 
ing at least one instance of an application, at 
least one computing system node, and at least 
one copy of a database schema, the copies of 
the database schema being replicated at each 
computing system partition within a network, 
and wherein each computing system partition 
manages interactions between the instances, 
the computing system nodes, and the copy of 
the database schema within the respective 
computing system partition; 
a plurality ol computing systems connected to- 
gether via the network, wherein each comput- 
ing system comprises one or more computing 
system partitions; 

at least one redundancy group, comprising at 
least one computing system and a plurality of 
computing system partitions, wherein each re- 
dundancy group manages the replication of the 
database schema within the computing system 
and computing system partitions within the re- 
dundancy group. 



4. The system of claim 1, wherein the redundancy 
is group can be redefined to include a different set of 

computing systems. 

5. The system of claim 1 , wherein the computing sys- 
tem partition can be redefined to include different 

20 instances of the application and different computing 
system nodes. 

6. The system of claim I , wherein at least one comput- 
ing system partition can be removed from the re- 

25 dundancy group. 

7. The system of claim 6, wherein the computing sys- 
tem partitions that are removed from the redundan- 
cy group can be added to a second redundancy 

30 group. 

8. The system of claim 1 , wherein a status of the com- 
puting system partitions is monitored by the redun- 
dancy group. 



35 



40 



45 



SO 



55 



9. The system of claim 8, wherein a task that is as- 
signed to a first computing system partition is reas- 
signed to a second computing system partition by 
the redundancy group. 

10. A method for distributing computer resources in a 
network environment, comprising the steps of: 

assembling, as part of a computer network, at 
least one computing system partition, including 
at least one instance of an application, at least 
one computing system node, and at least one 
copy of a database schema, the copies of the 
database schema being replicated at each 
computing system partition within the computer 
network; 

configuring, within the computer network, a plu- 
rality of computing systems connected together 
via the computer network, wherein each com- 
puting system comprises one or more comput- 
ing system partitions; 

configuring, within the computer network, at 
least one redundancy group, comprising at 
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least one computing system and at a plurality 
of computing system partitions; and 
managing at least one task using the database 
schema within the computing system and com- 
puting system partitions with the redundancy 
group. 

1 1 . The method of claim 1 0, wherein the task is a data- 
base replication within the computer network. 

12. The method of claim 11 , wherein the task of data- 
base replication is performed by a first computing 
system partition within the redundancy group. 

1 3. The method of claim 1 2, wherein the task of data- 
base replication is performed by a second comput- 
ing system partition within the redundancy group 
when the first computing system partition is unavail- 
able. 

14. The method of claim 10, wherein the redundancy 
group can be redefined to include a different set of 
computing systems. 

15. The method of claim 10, wherein the computing 
system partition can be redefined to include a dif- 
ferent subset of computing system nodes and in- 
stances of the application. 

16. The method of claim 10, wherein a first computing 
system partition can be removed from the redun- 
dancy group. 

17. The method of claim 16, wherein the first computing 
system partition can be added to a second redun- 
dancy group. 

1 8. The method of claim 1 0, further comprising the step 
of monitoring a status of the computing system par- 
titions by the redundancy group. 

19. The method of claim 18, wherein a task that is as- 
signed to a first computing system partition is reas- 
signed to a second computing system partition by 
the redundancy group. 

20. A method for providing database access, compris- 
ing the steps of: 

operating at least one computing system within 
a network, the computing system containing at 
least one computing system partition and the 
computing system being a member of a redun- 
dancy group, wherein the computing system 
partition includes at least one instance of an ap- 
plication, at least one computing system node, 
and at least one copy of a database schema, 
the copies of the database schema being rep- 



licated at each computing system partition with- 
in a network; and 

managing the replication of the database sche- 
ma within the computing system and computing 
s system partitions within the redundancy group. 
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